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INDEXED LIBRARY OF RECOMBINANT CONGENIC NON-HUMAN ANIMALS 

1.0. Field of the Invention 

The present invention relates to indexed libraries of 
5 recombinant congenic non-human animals, and methods for 
producing the same. The animals comprising the library are 
useful for the generation of offspring with specific genetic 
traits and phenotypes, as well as the study of the genetic 
basis of disease. 

10 

2.0. Background of the Invention 

Standard breeding practices have been developed and used 
over centuries of trial and error to develop a wide variety 
of domesticated animal breeds. The genetic similarity of the 

15 animals of a given breed results in a population of animals 
that, as a group, tend to have fairly uniform and predictable 
behavioral and growth characteristics. Unfortunately , 
classical "pure breed" animals do not breed true, and a 
significant proportion of the progeny of "pure breed" crosses 

20 often display a wide variety of abnormalities as an adverse 
consequence of poorly controlled in-breeding. 

Unlike "pure-bred" animals, inbred strains comprise 
animals that have been bred to contain essentially identical 
genetic content. Consequently, animals within a given inbred 

25 strain may generally be expected to breed true. Once inbred 
animals had been developed, the next phase in animal genetics 
involved the genetic construction of strains that contained 
discrete portions of the genetic material from one strain in 
the context of the genetic background of a second strain. 

30 Such strains are typically referred to as "congenic strains". 
Congenic animals have widely benefitted scientific 
research in fields such as immunology. Commercially 
available congenic mice generally have a relatively minor 
portion of the genome from the donor strain as compared to 

35 the genetic "background" strain. Consequently, even when 

considered in total, existing congenic strains only represent 
about a third of the mouse genome. Accordingly, only a 
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limited percentage of the mouse genome may be studied using 
the congenic mouse strains that are presently available. 

3.0. Summary of the Invention 

5 The present invention is directed to methods for 

systematically constructing a novel library of non-human 
recombinant congenic animals. More particularly, two 
relatively divergent (and preferably highly divergent) inbred 
strains of animals are identified or constructed, and these 

10 strains are then used to construct a library of recombinant 
congenic animals that collectively transfers a comprehensive 
set of the donor strain genome into the background strain. 
Generally, a reciprocal series of crosses shall complete the 
library by producing a substantially inverse set (i.e., 

15 reversing the relative "donor" and "backgrounds" of the first 
series of crosses) of recombinant genetic animals. 

Accordingly, one embodiment of the present invention is 
a library of non- human recombinant congenic animals 
comprising animals having genomes containing about 1.5 to 

20 about 10 percent donor parent genome. In a preferred 

embodiment, the presently described libraries shall comprise 
recombinant congenic strains having discrete and overlapping 
segments of donor genetic material that collectively 
constitute a substantially complete representation of the 

25 donor parent genome. 

Another aspect of the present invention are libraries of 
"indexed" or "genome tagged" recombinant genetic animals. 

Another embodiment of the present invention is the use 
of the animals in the recombinant congenic library to 

30 identify and analyze the genetic basis of disease (or absence 
thereof) . More particularly, an appropriately constructed 
recombinant congenic library allows for the identification 
and study of the gene or genes associated with a given 
phenotype . Accordingly, an additional aspect of the present 

35 invention is the generation and use of recombinant congenic 
animals that comprise regions of the donor genome that are 
sufficiently specific or defined to allow for fine structure 
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genetic mapping of phenotypic traits via the generation and 
characterization of animals that are congenic for specific 
genetic loci. 

Additional embodiments of the claimed invention are 
5 methods of screening recombinant genetic libraries by 
phenotypic screening followed by genetic fine structure 
mapping of the genetic basis for the selected phenotype . 

An additional embodiment of the claimed invention is a 
recombinant congenic animal that essentially comprises at 
10 least a single chromosome (diploid or haploid) corresponding 
to donor DNA with the remaining chromosomes essentially 
derived from a background strain (i.e., consomic) . 
Preferably, such an animal/strain shall be homozygous for 
substantially all of the alleles contained in the donor 
15 chromosome (as verified by genome tagging) . 

Another embodiment of the present invention is a 
recombinant congenic animal that comprises a substantially 
contiguous (as opposed to mosaic) portion of between about 1 
to about 150 centimorgans of donor genome in the context of a 
2 0 different genetic background. 

Additionally, another embodiment of the present 
invention are inbred recombinant congenic strains that 
comprise substantially contiguous (as opposed to mosaic) 
segments (of about 1 to about 150 centimorgans) of donor 
25 parent DNA in the context of at least one different genetic 
background . 

Another embodiment of the present invention is the use 
of the recombinant congenic strains of the library to breed 
or construct animals, particularly mice, that are useful as 
30 human disease models, or for the testing of pharmaceuticals 
and new chemical entities. 

4.0. Detailed Description of the Invention 

The presently defined invention allows for a rapid and 
35 comprehensive method for associating a given phenotype with 
the underlying genetic basis for the phenotype. More 
particularly, the present invention describes methods and 
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rationales for producing a library of recombinant congenic 
"genome- tagged" animals. The individual animal strains that 
collectively constitute the library may each contain either 
mosaic (i.e., interspersed throughout the genome) or 
5 discrete/defined (substantially contiguous) 

percentages/segments of a "donor" strain genome that have 
been placed in the genetic context of the "background" 
strain. For the purposes of the present invention the term 
"donor strain" shall refer to the strain of animal 

10 contributing the minority of the genome of a given 

recombinant congenic strain, and the terms "background 
strain" or "recipient strain" shall refer to the strain of 
animal contributing the majority of the genome of a given 
recombinant congenic strain. 

15 The presently described recombinant congenic strains are 

generally constructed by outcrossing two different inbred 
parental strains followed by a series of back-crosses and an 
extended series of brother/sister matings to produce 
recombinant congenic animals that comprise the desired 

20 percentages of the donor and background genomes. For 

example, statistics predict that about 25 percent of genomes 
in offspring of a first back-cross will be derived from the 
"donor" genome. Subsequent rounds of back-crossing may be 
statistically predicted to produce offspring comprising about 

25 12.5, 6.25, 3.125, or 1.5625, etc. percent donor genome. 
After back-crossing, the genetic composition of the back- 
crossed offspring is typically characterized by an extended 
series of genetic screens. Animals bearing the desired 
genotypes are subject to an extended series of brother/sister 

3 0 matings in order to produce inbred recombinant congenic 
strains that are homozygous for the desired genotypes or 
" tagged" genomes . 

During back-crossing as well as the construction of 
recombinant congenic and congenic strains, a variety of 

35 screening processes may be used to confirm both the relative 
percentages of donor/background genetic information, as well 
as to determine the specific regions of donor and background 
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DNA contained within each recombinant congenic strain. 
Methods of screening that may be used to assess the genetic 
make-up of an animal include, but are not limited to: 
polymerase chain reaction, genome mismatch scanning, 
5 restriction site polymorphism, sequence analysis, DNA 

hybridization, restriction landmark genome scanning (RLGS) , 
restriction fragment length polymorphisms (RFLP) , single- 
stranded conformation polymorphisms (SSCP) , etc. 

By identifying the genetic markers present in a given 

10 animal, the above screening procedures essentially provide a 
genetic index for each animal tested. Thus, the fact that 
the individual animals/strains in the library have been 
comprehensively genome- tagged is a novel feature of the 
presently described libraries. 

15 For the purposes of the present disclosure, the terms 

"indexing" or "genome- tagged" refer to the processes, during 
screening, by which genetic markers are tracked, catalogued, 
or otherwise inserted in order to identify the portions of 
the donor genome present in a given animal or strain. Once 

20 animals with the desired genetic make-up are identified 

during the genetic screening process, the desired genotypes 
are rendered homozygous by brother/sister matings, thereby 
constructing the appropriate recombinant congenic strains. 
Subsequent rounds of the genome screening/tagging process are 

25 used to verify the desired genotypes and to ensure that 
essentially all of the donor genome is collectively 
represented in the library of recombinant congenic animals. 
For the purposes of the present invention, a "substantially 
genome tagged animal or strain" has effectively been screened 

30 to an extent that a genetic map has been generated that 

accounts for at least one genetic marker of interest in every 
contiguous 1 to about 100 centimorgan region of the genome, 
preferably for at least one genetic marker in every 2 to 
about 75 centimorgans of the genome, more preferably about 5 

35 to about 65 centimorgans, and specifically at least one 

genetic marker in about every 50 centimorgans of the genome. 
It shall be understood that a given animal or strain may not 
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need to be comprehensively screened/tagged over regions in 
which both parents "are known to homozygously encode the same 
genetic content, and thus a given animal/strain may be 
effectively "substantially genome tagged" even though only a 
5 portion of that animal or strain's genome has actually been 
genetically mapped. A given animal or strain shall be 
considered "comprehensively genome mapped" where one can 
account for at least 1 genetic marker in about every 1 to 10 
centimorgans of the genome, preferably about 2 to about 8 

10 centimorgans, and specifically about 4 to about 6 
centimorgans of the genome (excluding telomeres) . 

In order to facilitate the screening processes (e.g., 
genome mismatch scanning, etc.) used to genome tag the 
recombinant congenic animals, it is advantageous to use 

15 parental strains that are genetically divergent to increase 
the chances that polymorphisms may be detected that allow one 
to genetically distinguish the parental strains. For the 
purposes of the present invention the term "genetically 
divergent" shall mean that the animals of interest shall 

20 generally contain an average of at least about 0.25 percent 
genetic mismatch, preferably at least about 0.5 percent, and 
more preferably at least about 0.75 percent. Typically, 
"highly divergent" strains shall comprise genetic backgrounds 
that differ by at least about 1.0 percent (i.e., the DNA 

25 sequences of the strains differ by an average of at least 
about 1 out of every 100 nucleotides) . 

Typically, the library shall comprise, inter alia, 
recombinant congenic animals having about twelve (12) percent 
of their genomes derived from the 'donor strain. Where such 

30 animals are used to construct a library, the number of the 
animal strains necessary to generate a set of recombinant 
congenic strains that collectively represent an essentially 
complete copy of the "donor strain" genome shall vary from 
about nine strains to several dozen or more (i.e., hundreds) 

35 strains depending upon the percentage of donor genome in each 
strain and the extent of genetic overlap that one may wish to 
engineer into the recombinant congenic strains that 



- 6 - 



WO 98/23145 



PCT7US97/21360 



collectively constitute the library. For the purposes of the 
present invention, "the term "substantially complete" shall 
mean that a statistical analysis may show that there is at 
least about a 95 percent probability that at least about 95 
5 percent of the donor genome is collectively represented in a 
given library. 

Preferably, the library of recombinant congenic animals 
will comprise strains that contain between about 0.5 percent 
and about 12 percent of the donor strain genome, more 

10 preferably, between about 1 percent and about 8 percent, 

specifically between about 1.5 percent and about 6.2 percent, 
and more specifically about 2 percent to about 4 percent of 
the donor strain genome, or any mixture or combination of the 
above. Alternatively, the library may comprise recombinant 

15 congenic strains having between about 5 to about 100 

centimorgans , typically about 10 to about 80 centimorgans , 
preferably about 15 to about 75 centimorgans, more preferably 
about 25 to about 6 0 centimorgans of donor genome, and 
specifically about 50 centimorgans of donor genome. Given 

20 that, for example, a typical mouse genome has approximately 
1,500 centimorgans of genetic material, 50 centimorgans 
corresponds to approximately 3 percent of the total mouse 
genome . 

Depending upon the percentage of the donor genetic 
25 material transferred into the animals in the library, a 

corresponding number of overlapping strains are necessarily 
required to form a library of recombinant congenic animals 
that collectively represent a substantially comprehensive 
portion* of the donor strain genome. Accordingly, the 
30 libraries of the present invention may comprise up to many 
hundreds of recombinant congenic animals strains. It should 
also be understood that the above number may effectively 
double when a complementary (or "mirror image" library is 
constructed using recombinant congenic animals bearing 
35 substantially inverse proportions of genetic material from 
the "donor" and "recipient" strains. For the purposes of the 
present invention, the term "substantially inverse" shall 
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mean that similar methods have been used to construct a 
series of recombinant congenic strains that reverse the 
"donor" and "recipient" genetic components. 

The recombinant congenic strains of the presently 
5 described library are typically subject to an extensive 
series of genetic screens. The result of this screening 
process is that each strain in the library will have been 
analyzed and characterized over a region spanning essentially 
the entire genome. Accordingly, each strain in the library 

10 will have been comprehensively evaluated or "genome tagged" . 
Consequently, the results of the genetic analysis (or genome 
tagging) may be compiled and stored in a database that, inter 
alia, shall also serve as a genetic "index" to the 
recombinant congenic strains in the library. 

15 An additional feature of the genomic screening process 

is that recombinant congenic strains shall be identified that 
comprise defined or discrete regions of the donor genome. 
For the purposes of the present invention, a discrete region 
of genome shall mean that a substantially contiguous series 

20 of genetic markers shall have been derived from one of the 
parent genomes. Accordingly, recombinant congenic strains 
that comprise defined or discrete segments shall generally 
comprise at least about 60 percent, preferably about 75 
percent, more preferably at least about 90 percent, 

25 specifically at least about 95 percent of the net donor 

parent genome component (of the total genome) in one to about 
three regions of substantially contiguous genetic 
information. Accordingly, a particularly preferred 
embodiment of the present invention is a library where at 

30 least one subset of the strains in the library have 

substantially contiguous regions of donor genome, and where 
the substantially contiguous regions of donor genome 
contained in these strains overlap to an extent that the 
strains collectively form a substantially complete 

35 representation of the donor genome in the context of at least 
one background genome. For the purposes of the present 
invention, the term "substantially contiguous" shall mean 
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that at least about 90 percent of the genetic markers 
screened within the region correspond to a single parent 
strain genome. The term "contiguous" (region of genome) 
shall refer to the situation where all of the genetic markers 
5 screened within a given region correspond to a single parent 
strain genome . 

Alternatively, recombinant congenic strains used to 
generate the presently described library may also comprise 
genetic mosaics (genomes that intersperse a multiplicity of 

10 donor sequences within the background genomes) . 

In any event, the presently described library of 
recombinant congenic animals shall allow for the rapid and 
facile identification of the genetic features that are 
associated with a given phenotype, or multigenic or polygenic 

15 disorder. 

Although mice are specifically exemplified herein, the 
present invention is broadly deemed to have far reaching 
applications. Virtually any species of animal that is 
capable of being inbred to an extent that it is essentially 

20 homozygous at all loci may serve as the basis for a library 
of recombinant congenic animals. Accordingly, in addition to 
use in studying the genetic contribution to disease and 
disease susceptibility, fields such as animal husbandry, wild 
life management, and the breeding of domesticated animals as 

25 pets may be significantly advanced by the presently disclosed 
invention. For example, a library of recombinant congenic 
swine could be used (in conjunction with a database on the 
genetic tendencies of the offspring of particular crosses) to 
generate- animals with predictable and enhanced body and 

30 growth characteristics (i.e., percent body fat, rapid growth, 
resistance to specific diseases, enhanced skin, fur, or 
coats, etc.). Since the recombinant congenic strains of the 
present invention breed true, offspring generated using the 
library strains may rapidly be converted into strains/breeds 

35 that breed true. Accordingly, the present invention 

effectively provides a method for developing a wide variety 
of recombinant congenic or inbred strains of animals bearing 
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desirable and predictable genetic tendencies, that also 
remain capable of predictably and reliably reproducing 
offspring sharing the desired qualities. 

In view of the above, animals that may be used to 
5 construct the presently described recombinant congenic 
strains may include, but are not limited to, pigs, rats, 
rabbits, cattle, goats, fish, and birds (particularly turkeys 
and chickens) , as well as animal species, particularly 
mammalian species, that are known in the art. Additionally, 

10 bovine, ovine, and porcine species, other members of the 
rodent family, e.g. rat, as well as rabbit and guinea pig, 
and non-human primates, such as chimpanzee, may be used to 
practice the present invention. Particularly preferred 
animals are rats, rabbits, guinea pigs, and most preferably 

15 mice. 

Given that recombinant congenic mice are specifically 
exemplified herein, an additional embodiment of the present 
invention is a recombinant congenic mouse, or inbred strain 
thereof, that generally comprises between 1 to about 150 

20 centimorgans , and preferably comprises about 10 to about 80 
centimorgans of the donor genome as a substantially 
contiguous region (or regions where multiple chromosomes are 
involved) of donor genome. Preferably, recombinant congenic 
mice, or strains thereof, bearing the indicated substantially 

25 contiguous regions of donor genome will be the offspring 
arising after at least about 3 rounds, preferably at least 
about six rounds, and more preferably at least about a dozen 
rounds of brother- sister matings. In a particularly 
preferred embodiment, the presently described recombinant 

30 congenic mice will have been screened to verify the 

substantially contiguous nature of the donor parent genetic 
component, before, during, and after (or all or any 
combination thereof) the requisite series of brother-sister 
matings . 

35 A particularly preferred embodiment of the present 

invention includes the use of the presently described library 
of recombinant congenic strains to study the genetic basis of 
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disease. Present techniques of classical genetic analysis 
are generally limited to non-epistat ic genetic traits. 
Accordingly, classical genetic techniques are only of limited 
value in instances where a given disease or phenotype is 
5 dependent on the interaction of multiple genetic loci. The 
presently described libraries of recombinant congenic animals 
provide the tools necessary for a systematic analysis of the 
genetic component of a variety of animal diseases or 
developmental and behavioral characteristics. Additionally, 

10 because of the substantial amount of donor genetic material 
present in the various strains in the library, the presently 
disclosed strains are particularly well suited for the study 
of multigenic and polygenic genetic disorders or tendencies. 
In particular, the novel genetic content of the strains 

15 in the library should correlate with the observation of 

equally novel phenotypic tendencies (with or without measures 
to promote a given phenotype) . Additionally, it has been 
shown that a given animal strain may serve as a useful and 
informative human disease model when subject to the proper 

20 environmental stimuli or stress. Accordingly, an additional 
embodiment of the present invention includes the employment 
of a systematic method of identifying and subjecting an 
animal to a series of environmental stimuli or stresses in 
order to determine whether the animal, when subject to a 

25 specific stress or environment, develops symptoms or signs 
that are similar to, or associated with, a known disease in 
other animals including humans. Examples of such stresses or 
environmental stimuli may include, but are not limited to: 
specialized (e.g., high-protein or high fat, etc.) diets, 

30 exposure to toxic agents or chemicals, lethal or sub-lethal 
irradiation, drug treatments, hyperthermia or hypothermia, 
viral infection, bacterial infection, exercise tolerance and 
training, response to inhalation of various agents (e.g., 
smoke, asbestos, airborne chemical toxins, etc.), behavioral 

35 responses (e.g., maze training), circadian rhythm changes, 

physical shock, surgeries, behavioral stimuli, sound, natural 
and synthetic pharmaceuticals or chemical agents, exposure to 
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foreign DNA, immune challenge, immunocompromising agents or 
procedures, etc. 

Typically, more extensive and detailed phenotyping 
procedures are more likely to identify a relationship between 
5 a disease symptom and the responsible gene or genes. One of 
the most valuable but labor-intensive of the phenotypic 
measurements is that of temporal, environmental and tissue- 
specific patterns of gene expression in normal and diseased 
tissues. Combining such gene expression data with other 

10 phenotypic data, as well as the genotypic data from the 

library, provides a comprehensive approach to gene discovery. 

The selection of the proper challenge/stimulus may be 
done randomly, or may be consistent with previously 
identified disease associations (e.g., ulcers/helicobacter 

15 infection, Crohn's disease/bacteriodes or M. paratuberculosis 
infection, etc . ) . 

An additional feature of subjecting the recombinant 
congenic animals of the library to environmental or dietary 
stress or challenge is that genotypes are identified that 

20 correlate with an enhanced ability to avoid or substantially 
mitigate the adverse consequences normally associated with 
such stresses. Accordingly , another embodiment of the 
present invention relates to the identification and 
characterization of the genetic basis for an enhanced 

25 resistance to normally adverse stimuli (e.g., high fat diets, 
sleep deprivation, noxious inhalants, etc.). Additionally, 
the presently described methods and libraries are equally 
well suited for identifying the molecular basis for 
genetically determined advantages such as prolonged life- 

30 span, low cholesterol, low blood pressure, exercise 

tolerance, optimal respiratory quotient, low percent body 
fat, reduced bone absorption, increased memory, attenuation 
of sensitivity or prevention of inflammatory or autoimmune 
disorders, etc . 

35 An additional feature of the present invention is that 

it also provides means of identifying genes that do not 
actually cause a disorder, but modify or mediate the severity 
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of the phenotype or disorder. Examples of such situations 
include, but are not limited to, genes that alter age at 
disease onset, or genes that affect the progression and 
outcome of neoplasms (e.g., metastasis). 
5 Once a relevant disease pathway has been identified, 

other recombinant congenic strains in the presently described 
libraries may be similarly stressed to ascertain whether the 
disease symptoms correlate with a particular region (or 
regions) of the genome. Where such a correlation is seen, 

10 the overlapping and reductive nature of the recombinant 
congenic strains in the library allow for the rapid fine- 
structure genetic analysis of the disease system. 
Alternatively or additionally, after a given genomic region 
has been associated with a given phenotype, fine-structure 

15 genetic mapping of the phenotype may be conducted by using 
the relevant recombinant congenic animals to construct true 
congenic animals (by an additional series of back-crosses) in 
order to narrow the precise genetic interval containing the 
gene of interest. The resulting congenics animals may 

20 subsequently be used to more specifically identify and 

isolate the loci that are associated with the phenotype of 
interest . 

Given the sensitivity of the genetic tools available 
through the library, the eventual identification and 

25 isolation of the specific gene or genes that are associated 
with the disease symptoms may be accomplished more rapidly 
and efficiently. For example, recombinant congenic strains 
necessarily delimit the area of genetic analysis. Thus, once 
a phenotypic trait has been associated with a given 

30 recombinant congenic strain, one significantly reduces the 
time and expense necessary to positionally clone a given gene 
via established methodologies (e.g., chromosome walking, YAC 
construction, exon trapping, polynucleotide or cDNA 
hybridization or sequence analysis, etc.). 

35 Once a putative gene or other region of the chromosome 

has been isolated, the gene may subsequently be analyzed 
using established molecular biology techniques (i.e., 
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cloning, DNA sequencing, polymerase chain reaction, etc., see 
generally Sambrook- et al . (1989) Molecular Cloning Vols. I- 
III, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New York, and Current Protocols in Molecular Biology (1989) 
5 John Wiley & Sons, all Vols, and periodic updates thereof, 
herein incorporated by reference) . Additionally, once such 
sequences have been isolated and characterized, they may be 
used as heterologous probes to rapidly and efficiently 
identify and isolate related sequences in other organisms and 
10 animals. 

The presently disclosed methods, libraries, and 
databases are useful for the diagnosis, study, and treatment 
of a wide variety of diseases and disorders including, but 
not limited to: autoimmune disease, arthritis (e.g. 

15 rheumatoid and osteo) , atherosclerosis, obesity, 

osteoporosis, microbial or viral infection, inflammatory 
bowel diseases, diabetes (insulin dependant or non- insulin 
dependant), multiple sclerosis, cancer, Crohn's disease and 
ulcerative colitis, asthma, hypertension, dementia, 

20 Alzheimer's disease, stroke, hepatitis, chronic liver 
disease, peptic ulcers, gastritis, neurodegenerative 
disorders, neurological or psychiatric disorders, systemic 
lupus erythematosus, scleroderma, post -angioplasty vascular 
inflammatory response, etc. 

25 Cancers that may be diagnosed or treated using the 

presently described methods include, but are not limited to: 
Breast: sarcoma and carcinoma; Cardiac : sarcoma 
(angiosarcoma, fibrosarcoma, rhabdomyosarcoma, liposarcoma) , 
myxoma, rhabdomyoma, fibroma, lipoma and teratoma; Lung: 

30 bronchogenic carcinoma (squamous cell, undifferentiated small 
cell, undifferentiated large cell, adenocarcinoma) , alveolar 
(bronchiolar) carcinoma, bronchial adenoma, sarcoma, 
lymphoma, chondromatous hamartoma, mesothelioma; 
Gastrointestinal : esophagus (squamous cell carcinoma, 

35 adenocarcinoma, leiomyosarcoma, lymphoma) , stomach 

(carcinoma, lymphoma, leiomyosarcoma) , pancreas (ductal 
adenocarcinoma , insulinoma , glucagonoma , gastrinoma , 
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carcinoid tumors, vipoma) , small bowel (adenocarcinoma, 
lymphoma, carcinoid tumors, Karposi's sarcoma, leiomyoma, 
hemangioma, lipoma, neurofibroma, fibroma) , large bowel 
(adenocarcinoma, tubular adenoma, villous adenoma, hamartoma, 
5 leiomyoma) ; Genitourinary tract : kidney (adenocarcinoma, 
Wilm' s tumor [nephroblastoma], lymphoma, leukemia), bladder 
and urethra (squamous cell carcinoma, transitional cell 
carcinoma, adenocarcinoma) , prostate (adenocarcinoma, 
sarcoma) , testis (seminoma, teratoma, embryonal carcinoma, 

10 teratocarcinoma , choriocarcinoma, sarcoma, interstitial cell 
carcinoma, fibroma, fibroadenoma, adenomatoid tumors, 
lipoma) ; Liver : hepatoma (hepatocellular carcinoma) , 
cholangiocarcinoma , hepatoblastom, angiosarcoma, 
hepatocellular adenoma, hemangioma; Bone : osteogenic sarcoma 

15 (osteosarcoma) , fibrosarcoma, malignant fibrous histiocytoma, 
chondrosarcoma, Ewing's sarcoma, malignant lymphoma 
(reticulum cell sarcoma) , multiple myeloma, malignant giant 
cell tumor, chordoma, osteochronf roma (osteocartilaginous 
exostoses) , benign chondroma, chondroblastoma, 

20 chondromyxof ibroma, osteoid osteoma and giant cell tumors; 
Nervous system : skull (osteoma, hemangioma, granuloma, 
xanthoma, osteitis deformans) , meninges (meningioma, 
meningiosarcoma, gliomatosis) , brain (astrocytoma, 
medulloblastoma, glioma, ependymoma, germinoma [pinealoma] , 

25 glioblastoma multiforme, oligodendroglioma, schwannoma, 
retinoblastoma, congenital tumors) , spinal cord 
(neurofibroma, meningioma, glioma, sarcoma) ; Gynecological : 
uterus (endometrial carcinoma) , cervix (cervical carcinoma, 
pre-tumor cervical dysplasia) , ovaries (ovarian carcinoma 

30 [serous cystadenocarcinoma , mucinous cystadenocarcinoma , 
endometrioid tumors, celioblastoma, clear cell carcinoma, 
unclassified carcinoma] , granulosa- thecal cell tumors, 
Sertoli-Leydig cell tumors, dysgerminoma , malignant 
teratoma) , vulva (squamous cell carcinoma, intraepithelial 

3 5 carcinoma, adenocarcinoma, fibrosarcoma, melanoma) , vagina 
(clear cell carcinoma, squamous cell carcinoma, botryoid 
sarcoma [embryonal rhabdomyosarcoma] , fallopian tubes 
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(carcinoma) ; Hematologic : blood (myeloid leukemia [acute and 
chronic] , acute lymphoblastic leukemia, chronic lymphocytic 
leukemia, myeloproliferative diseases, multiple myeloma, 
myelodysplastic syndrome), Hodgkin' s disease, non-Hodgkin ' s 
5 lymphoma [malignant lymphoma] ; Skin : malignant melanoma, 
basal cell carcinoma, squamous cell carcinoma, Karposi's 
sarcoma, moles, dysplastic nevi, lipoma, angioma, 
dermatofibroma, keloids, psoriasis; and Adrenal glands : 
neuroblastoma . 

10 Given that the disclosed libraries allow for the rapid 

identification of the genetic basis for a disease, an 
additional aspect of the present invention is the 
identification and cloning of human homologues to genes 
identified using the presently disclosed libraries of 

15 recombinant congenic strains. Ideally, the human homologue 
of a given gene will hybridize with the heterologous probe 
under stringent conditions. For the purposes of the present 
invention "stringent conditions" generally refers to 
hybridization conditions that (1) employ low ionic strength 

20 and high temperature for washing, for example, 0.015 M 

NaCl/0.0015 M sodium citrate/0.1% SDS at 50°C; or (2) employ 
during hybridization a denaturing agent such as formamide, 
for example, 50% (vol/vol) formamide with 0.1% bovine serum 
albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium 

25 phosphate buffer at pH 6.5 with 750 mM NaCl, 75 mM sodium 

citrate at 42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M 
NaCl, 0.075 M Sodium pyrophosphate, 5 x Denhardt's solution, 
sonicated salmon sperm DNA (50 g/ml) , 0.1% SDS, and 10% 
dextran sulfate at 42°C, with washes at 42°C in 0 . 2 x SSC and 

30 0.1% SDS. The above examples of hybridization conditions are 
merely provided for purposes of exemplification and not 
limitation. A more thorough treatise of the such routine 
molecular biology techniques may be found in Sambrook et al . , 
Molecular Cloning , Cold Spring Harbor Laboratory Press, Cold 

35 Spring Harbor, New York, Vols. 1-3: (1989), and periodic 
updates thereof, herein incorporated by reference. 
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Once a genetic locus of interest has been identified, an 
additional embodiment of the present invention is the testing 
of the observed genetic correlation via the production of 
transgenic animals that have been engineered to be deficient 
5 in (or to over express) one or more alleles of the gene of 
interest. Typically, in the case of knockout transgenics, 
such transgenic animals are produced by using disruptive 
recombination in embryonic stem cells that are subsequently 
incorporated into blastocysts. Alternatively, in the case of 
10 expression-enhanced transgenics, DNA is injected into the 

male pronucleus of a fertilized egg and implanted into pseudo 
pregnant females. Such gene-modified animal cells and 
transgenic animals may be prepared using any of several 
techniques that are well established in the art. In 
15 particular, techniques may be used that are conceptually 

similar to those taught in U.S. Patent No. 5,464,764 issued 
to Capecchi and Thomas on November 7, 1995, herein 
incorporated by reference. 

Other embodiments of the present invention include cell 
20 lines derived from the recombinant congenic animals of the 
presently described libraries. Once a genotype has been 
associated with a disease, disorder, or advantage, the 
precise molecular basis of the disorder may be best analyzed 
at the cellular level. Accordingly, cell lines derived from 
25 tissues manifesting the adverse or advantageous genetic 
tendencies may be used to more particularly study the 
chemical basis for a given disease or advantage. 

In many instances, different cellular genotypes may 
exhibit markedly different patterns of gene expression, and 
3 0 different tissues within a genotype may also be 

differentially effected. Thus, an additional embodiment of 
the presently disclosed invention is the development of 
expression profiles for recombinant congenic cells and 
recombinant congenic animals with varying genetic content. 
35 Additionally, expression profiles may be determined for 
different tissues having the same genotype, or for similar 
tissues having the same genotype which were obtained from 
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animals that may or may not display a given phenotypic trait. 
For the purposes of the present invention, the term 
"expression profile" shall refer to a sequence database that 
provides quantitative and qualitative information regarding 
5 the genes expressed by a given animal , or cells obtained or 
derived therefrom. Generally, such profiles may be generated 
by isolating mRNA from ("affected" and "control") animal 
tissue or cells, reverse transcribing the mRNA, constructing 
a cDNA library, sequencing the clones in the cDNA library, 

10 and gathering and organizing, and analyzing the DNA sequence 
data. A comparison of the expression profiles for tissues or 
cells from animals with different genotypes provides a direct 
measure of the cellular consequences of the genetic variance. 
Alternatively, an expression profile may be generated by 

15 generating the appropriate cDNA library, and screening the 
cDNA library with differentially labeled RNA from normal and 
affected animal tissue. The clones whose expression are 
altered (in comparison to normal) are then identified, 
sequenced and further analyzed. 

20 Additionally, chemical mutagenesis may be used to 

effectively generate new alleles at a given genetic locus, 
and the expression profiles of mutagenized animals or cells 
may be screened as discussed above. 

A still further aspect of the invention relates to 

25 constructing subtract ion- libraries generated by using 

transcripts from nonstimulated or nonresponder samples to 
subtract transcripts that are not responsible for the trait 
of interest. Subtraction-libraries may be constructed from a 
source of interest using techniques which are standard in the 

30 art ( Molecular Cloning: A Laboratory Manual , 2d ed., 

Sambrook, J., Fritsch, E.F., andManiatis, T. (1989)). For 
example, a subtraction- library may be constructed that will 
identify those cDNAs which are "new" in a recombinant 
congenic strain exhibiting the trait of interest by 

35 subtracting out the cDNAs that are present in control cells. 
Such cDNAs constitute ideal probes for identification and 
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localization of the gene or genes responsible for the 
phenotypic trait of" interest. 

Alternatively, standard subtraction analysis may be 
combined with gene expression profiling of the genes that are 
5 turned on, turned off, or modulated in disease states. After 
a modulated gene has been identified and associated with a 
trait of interest, further analysis may reveal a coherent and 
detailed picture of the sequence of events that occur during 
the disease process. This approach provides a powerful 

10 complement to standard positional cloning methods. 

High- throughput and extremely sensitive assay systems 
allow for the construction of detailed, tissue-specific gene 
expression profiles in animals such as humans and mice. In 
particular, a technology platform based on automated 

15 differential gene expression analysis is exemplified. Such 
platforms have previously involved hybridizing cDNAs derived 
from "normal" and "abnormal" tissues to arrays of cDNAs from 
a "target" cDNA library. The data obtained from such a 
platform may then be used in further gene discovery efforts. 

20 The following examples are provided solely to illustrate 

the subject invention. Given the level of skill in the art, 
one may be expected to modify any of the above or following 
disclosure to produce insubstantial differences from the 
specifically described features of the present invention. As 

25 such, the following examples are provided by way of 

illustration and are not included for the purpose of limiting 
the invention in any way whatsoever. 

5.0. EXAMPLES 
30 5.1. Library construction 
5.1.1. Animal Strains 

In order to highlight the desired phenotypic 
effects and facilitate the genetic screening process, it is 
generally desirable to construct a genome tagged recombinant 
35 congenic library using relatively divergent animal strains. 

Where mice are used to construct a library, examples of 
such divergent strains may be found within the Mus musculus 

- 19 - 



WO 98/23145 



PCI7US97/21360 



group. For example, Mus musculus domesticus (Domesticus) and 
Mus musculus castaneus (Castaneus) are highly divergent mouse 
subspecies. In order to facilitate phenotypic screening, it 
is also generally desirable to utilize strains that have been 
5 previously characterized as possessing known phenotypic 
traits. Additionally, it is desirable to select donor and 
background strains on the basis of having many phenotypic 
differences between them (e.g., atherosclerosis 
susceptibility, obesity, bone density, etc.). 

10 Apart from its genetic distance, Castaneus is also 

phenotypically distinct from many domesticus strains. For 
example, Castaneus has a much higher bone density than the 
Domesticus strain C57BL/6, which would make a recombinant 
congenic animal from the two strains an ideal candidate model 

15 for a genetic study of osteoporosis and bone density 
differences . 

The following is a partial listing of some of the 
phenotypic characteristics of several Domesticus strains that 
may be used to construct the described libraries: 

20 

a. C57BL/6 

Susceptible to the development of atheromatous 
lesions on wall of aorta after 20 weeks on a high-fat diet 
(Thompson, 1968; Roberts and Thompson, Paigen, 1995) . 

25 Develop fatty streak-like lesions in the valve sinus region 
of the ascending aorta after 10-20 weeks on a diet enriched 
in saturated fat and cholesterol. After a further 15 weeks, 
fibro- fatty lesions are found with many of the 
characteristics of human atheromatous plaques (Stewart - 

30 Phillips and Lough, 1991, Atherosclerosis 50:211-218) . 
Develops non-insulin-dependent diabetes mellitus and 
hypertension when fed a high fat -high simple carbohydrate 
diet, whereas A/ J mice do not (Mills et al . , 1993, Am. Jour. 
Phys . 264:R73-8). Low plasma cholesterol at 12 and 24 weeks. 

35 Low plasma triglyceride levels and low plasma cholesterol 
(Jiao et al., 1990, Metabolism 39:155-160). Low serum 
ceruloplasmin levels in males but intermediate in females 
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(Meier and MacPike, 1968, Proc . Soc . Exp. Biol. Med. 

128:1185-1190) . High blood sugar (Nishimura, 1969, Exp. 

Animals (Japan) 18:147-157) . High liver tyrosine 

aminotransferase in fasted mice but low in C57BL/6~ob (Blake, 
5 1970, Int. J. Biochem. 1:361-370). Low calcium uptake by the 

heart (Mokler and Iturrian, 1973, Proc. Soc. Exp. Biol. Med. 

142:919-923). High aldehyde dehydrogenase and alcohol 

dehydrogenase activity compared with DBA/ 2 (Sheppard et al . , 

1968, Biochem. Genet. 2:205-212). Low basal levels of kidney 
10 catalase, superoxide dismutase, and renal glutathione 

reductase (Misra et al 1991, Toxicology Letters 58:121-33). 

High alcohol preference (Fuller, 1964, J. Comp. Physiol. 

Psychol. 57: 85-88; Rodgers, 1966, Psychosomat . Med. 28:498- 

513). Low "emotionality", high open-field exploration 
15 (Thompson, 1953, Can. J. Psychol. 7:145-155). High 

preference for sweet tasting substances (Lush, 1988, Genetic. 

Res. Camb. 53:95-99). Low susceptibility to audiogenic 

seizures (Deckard et al . , 1976, Develop. Psychobiol . 24:9- 

17) . 

20 

b. A/ J 

Can be made obese by a suitable diet (Fenton and 
Dowling, 1953, J. Nutrition 49:319-331). Does not develop 
non-insulin-dependent diabetes mellitus and hypertension when 

25 fed a high fat-high simple carbohydrate diet, whereas C57BL/6 
mice do (Mills et al . , 1993). High incidence of amyloidosis 
(Russell and Meier, 1966, In Biology of the Laboratory Mouse , 
2d, (Green, ed.), McGraw-Hill, New York, pp. 571-587). Low 
metabolic rate (Storer, 1967, Exp. Gerontol 2:173-182) . Low 

30 serum ceruloplasmin in males but intermediate in females 

(Meier and MacPike, 1968) . Responds by higher growth rate on 
high fat diets (Fenton and Carr, 1951, J. Nutrition 45:225- 
233) . High percent carcass lipid on a high-fat diet (West et 
al . , 1992). Develops autoimmune phenomena, immunological 

35 deficits with aging and autoimmunity following neonatal 

thymectomy (Yunis et al . , 1972, The thymus, autoimmunity and 
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the involution of the lymphoid system ", (ed. Sigel, M.M. and 
Good, R.A.), Thomas ,- Springfield, 111., pp. 62-119). 

C . DBA/ 2 J 

5 Susceptible to audiogenic seizures (Fuller and 

Sjursen, 1967, J. Hered. 58:135-140). Spontaneous calcified 
heart lesions progress with age - 90% of individuals affected 
by 1 year (Rings and Wagner, 1971, Lab. Animal Sci . 22:344- 
352) . Incidence of calcareous heart lesions high among some 

10 related strains (Di Paola et al . , 1964, Proc. Soc . Exp. Biol. 
Med. 115:496-497). Other lesions include malignant lymphoma 
and degenerative processes in the myocardium, skeletal 
muscle, subcutaneous adipose tissue, cornea and blood 
vessels. Lesions partly depend on diet (Hare and Stewart, 

15 1956, J. Nat. Cancer Inst. 16: 889 -911 ) . High metabolic rate 
(Storer, 1967, Exp. Gerontol 2:173-182). High concentration 
of epinephrine and norepinephrine in adrenals (Ciaranello et 
al., 1972, Life Sci. 11 (1) : 565 -572 ) . Low liver tyrosine 
aminotransferase activity in fasted mice (Blake, 1970) . High 

20 calcium uptake by the heart (Mokler and Iturrian, 1973) . Low 
aldehyde and alcohol dehydrogenase activity compared with 
C57BL/6 (Sheppard et al . , 1968). 

d. C3H/HEJ 

25 Can be made obese by a suitable diet (Fenton and 

Dowling, 1953). Low blood pressure (Mull ink et al . , 1975, 
Personal Communication) . High plasma cholesterol and 
triglycerides. High erythrocyte catalase (Hoffman and 
Rechcigl, 1971, Enzyme 12:219-2225) . Low metabolic rate 

30 (Pennycuik, 1967, Aust. J. Biol. Med. Sci. 45:331-346). Low 
adrenal corticosteroid production (Nandi et al . , 1967, 
Endocrinology 80:576-582) . Low basal level of renal 
glutathione S- transferase but high basal level of renal 
glutathione reductase (Misra et al . 1991). Has poor immune 

35 response to endotoxic lipopolysaccharide due to a B-cell 
deficit (Rosenstreich and Glode, 1975, J. Immunol. 115:777- 
780; Coutinho, 1976, Scand. J. Immunol. 5:129-140). High 
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liver tyrosine aminotransferase level in fasted mice (Blake, 
1970) . Good short-term but poor long-term memory in contrast 
with DBA/ 2 (Bovet et al . , 1969, Psychopharmacologia 10:1-5). 

5.1.2. Methods Used to Produce Recombinant Congenic 
Animals _ 

Standard genetic outcrosses are used to initially cross 

the parental strains to produce offspring that are comprised 

of 50 percent of each parental genome. Back-crossing the 

offspring to the relevant parent results in offspring with 

genomes that are 25 percent "donor" -derived. Subsequent 

rounds of back-crossing are used to produce successive rounds 

of offspring that reduce the amount of donor-derived genetic 

information by one-half per each back-cross. Once offspring 

with the appropriate genetic content are identified, an 

extended series of brother-sister matings are used to "fix" 

the genotype by generating recombinant congenic strains that 

are homozygous for the desired genotype. The resulting 

strains are subsequently rescreened to verify the qualitative 

and quantitative features of their respective genotypes. 

5.1.3. Methods Used to Analyze Recombinant Congenic 
Animals 

Previously, back-crosses and intercrosses have been used 
to locate loci affecting a wide variety of phenotypes. 
However, fine genetic mapping of the loci of interest 
requires the generation of hundreds -to -thousands of progeny, 
which are each genotyped and phenotyped. Using molecular 
biology techniques to screen for the presence and 
organization of the relevant genetic tags markedly simplifies 
and expedites the screening procedure, and allows for the use 
of methods of data generation and storage that are readily 
adaptable to automation. A particularly preferred method of 
genotyping that is well suited for automation is 
microsatellite mapping . 

Microsatellites are highly polymorphic genetic elements 
that are present at high density throughout all mammalian 
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genomes studied. Also known as simple sequence repeats (or 
SSR) , microsatellites consist of mono-, di-, tri- or 
tetrameric sequences repeated multiple times in a tandem 
array. Allelic variation arises from differences in the 
5 number of repeats present in a given tandem array, which 
results in variation in the total length of the 
microsatellite . Typically, the length of a satellite is 
measured by PCR amplification of the microsatellite followed 
by gel electrophoresis of the product to determine the size. 
10 Unique nucleotide sequences on each side of the repeat are 
chosen as primers for the PCR amplification reaction. The 
unique flanking sequences also serve to locate, or map, a 
particular microsatellite on the genome studied. In this way 
a microsatellite marker becomes a tag for that position on 
15 the genome. A large number of simple sequence length 
polymorphisms or microsatellite markers have been mapped 
throughout the mouse and human genomes (Dietrich et al . , 
1996, Nature 380:149-152; Dib et al . , 1996, Nature 380:152- 
154 both of which are herein incorporated by reference) , 
2 0 thereby generating a linkage map of the markers for each 
chromosome in the genome . 

Thus, an animal may be genotyped by screening and 
tagging the parental alleles that are present in the animal's 
genome. Typically, a genome is initially tagged using 
25 several hundred of the polymorphic microsatellite markers 
that effectively cover the entire genome. This method has 
already been used to genotype several different strains of 
mice (Whitehead/MIT Genome Center database) . 

In constructing the libraries, the genotype of an 
30 offspring is determined as described above and compared to 
the genotype of the two parental strains using about 100-400 
microsatellite markers, preferably about 350, that are 
dispersed throughout the genome. By this means, offspring 
are identified that contain the desired percentage of the 
35 donor parent genome, and the desired region or regions of the 
donor parent genome. These offspring are subsequently 
sibling-mated to construct a recombinant congenic strain that 
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is homozygous for the desired genotype. The donor region in 
each strain is finely mapped using either existing or newly 
identified markers. Such mapping is used to define the 
boundaries of the chromosomal region from the donor parent 
5 genome, to further confirm that the donor region is 

contiguous, and to create a fine genetic map of the donor 
genome segment . 

It must also be noted that at any step during the above 
process, the offspring may be phenotyped for any trait of 

10 interest. Offspring may be chosen for further backcrossing 
and sibling-mating if they display a desired phenotype . 

Alternatively, the recombinant congenic strains are 
constructed, screened, and used to assemble a genome tagged 
library comprising strains having overlapping regions of 

15 donor genome (which may or may not be contiguous or 

substantially contiguous) , in the context of at least one 
background genome. The animals in the library may then be 
used in phenotypic screening studies. 

20 5.1.4. Quantitative Trait Locus Mapping 

Quantitative trait locus mapping (QTL) is a method that 
has been successfully used for the analysis of many traits, 
including the identification of modifying genes. This method 
is desirable because of the availability of numerous 

25 polymorphic markers spanning the entire mouse genome. The 
approach involves phenotyping backcross (or F2) progeny for 
the trait of interest as well as genotyping with genetic 
markers throughout the genome. Typically, nucleic acid 
samples obtained from the subject animals are genetically 

30 typed and analyzed for association of parental alleles with 
phenotypes of interest . 



35 
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5.2. Methods of Using the Library to "identify the Genetic 

Basis for Disease, or the Genetic Basis for a Phenotypic 
Advantage 

5.2.1. Methods of Screening for Phenotypic Effects 

5 Typically, a wide variety of well established 

animal handling and feeding methods are used to screen 
recombinant congenic libraries for specific phenotypic 
effects. Generally, the animals in the library are observed 
in order to identify any inherent phenotypic traits. Since 

10 the animals in the library have been tagged and genetically 
catalogued during screening, virtually any observed phenotype 
may be rapidly correlated with a given genotype or particular 
region of the donor or background genome. Additionally, a 
representative group of animals from the library may be 

15 subject to an appropriate stimuli prior to screening for 
animals that exhibit distinctive phenotypic traits. 

In addition to traditional stimuli such as controlling 
diet and the environment, more aggressive techniques may be 
used such as 1 , 2 -dimethylhydrazine injection to study the 

20 genetics of susceptibility to formation of aberrant crypt 
foci and colon adenomas (Moen et al . 1996, Can. Res., 
56 (10) :2382-6) . 

As another example, Mori et al . , 1995, Genomics 
25(3): 609-14 used recombinant congenic (CcS/Dem) strains 

25 derived from inbred mouse strains BALB/cHeA (susceptible) and 
STS/A (resistant) to study the genetics of susceptibility to 
radiation- induced apoptosis of thymocytes after whole-body 
irradiation . 

30 5.2.1. Methods of Generating Expression Profiles 

After phenotypic screening has identified a 
recombinant congenic strain of interest is identified in the 
library, one can further analyze the regulatory consequences 
of the genotype by expression profile mapping. Typically, 
35 expression profiles are generated by obtaining mRNA samples 
from the relevant tissues in an appropriately stressed 
animal, generating cDNA, and using DNA sequence analysis to 
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quantitatively and qualitatively assess gene expression in 
the affected tissues. Such expressed sequence tag (EST) 
expression profiles are useful as a comparative tool versus 
expression profiles obtained from unstimulated or unaffected 
5 tissues or strains . 

a) Methods of Using Expression Profiles 

The sequence data comprising the expression profile 
may be used to generate an electronic relational database 

0 that may be organized by cataloging the quantity of clones 
corresponding to a given gene as well as the variety of genes 
represented in the expression profile. Database programs 
that are particularly deemed useful for the above analysis 
may be adapted from commercially available software such as 

5 that available from Oracle or Informix, for example. 

Of particular interest is the use of sequence, or 
sequence data, obtained from an non-stimulated, or 
nonresponder "control" animal to qualitatively or 
quantitatively "subtract" sequences that are not uniquely 

0 represented or over-represented in the strain exhibiting the 
desired trait or phenotype . Such substraction analysis will 
drastically reduce the fine structure genetic analysis of the 
phenotype at issue. 

5 EQUIVALENTS 

The foregoing specification is considered to be 
sufficient to enable one skilled in the art to broadly 
practice the invention. Indeed, various modifications, 
variations, and combinations of the above-described methods 

0 for carrying out the invention which are obvious to those 
skilled in the fields of genetics, microbiology, 
biochemistry, organic chemistry, medicine or related fields 
are intended to be within the scope of the following claims. 
All patents, patent applications, and publications cited are 

5 herein incorporated by reference. 
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What is claimed is : 

1. A library of non-human recombinant congenic animals, 
comprising: substantially genome tagged animals having 
genomes containing about 10 to about 80 centimorgans of donor 

5 parent genome . 

2. The library of claim 1 wherein said donor parent 
genome is substantially contiguous. 

3 . The library of claim 1 wherein said animals are 
mammalian . 

10 4. The library of claim 3 wherein said mammalian 

animals are rodents. 

5. The library of claim 4 wherein said rodents are a 

mice . 

6. The library of claim 5 wherein said mice 
15 collectively represent a substantially complete 

representation of the donor parent genome. 

7. A non-human recombinant congenic animal having a 
genome comprising about 1 to about 5 percent donor parent 
genome as a substantially contiguous region. 

20 8. The recombinant congenic animal of claim 7 wherein 

said animal is a mammal. 

9. The recombinant congenic animal of claim 8 wherein 
said mammal is a rodent . 

10. The recombinant congenic animal of claim 9 wherein 
25 said rodent is a mouse. 

11. An inbred strain of the recombinant congenic mouse 
of claim 10 . 
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